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Abstract 



Quantum mechanics is derived as an apphcation of the method of 
maximum entropy. No appeal is made to any underlying classical action 
principle whether deterministic or stochastic. Instead, the basic assump- 
tion is that in addition to the particles of interest x there exist extra 
variables y whose entropy S{x) depends on x. The Schrodinger equation 
2 , follows from their coupled dynamics: the entropy S{x) drives the dynam- 

r^ ' ics of the particles x while they in their turn determine the evolution of 

^^1 S{x). In this "entropic dynamics" time is introduced as a device to keep 

track of change. A welcome feature of such an entropic time is that it 
naturally incorporates an arrow of time. Both the magnitude and the 
phase of the wave function are given statistical interpretations: the mag- 
nitude gives the distribution of x in agreement with the usual Born rule 
w-s ' and the phase carries information about the entropy S{x) of the extra 

rf^ , variables. Extending the model to include external electromagnetic fields 

fSJ ' yields further insight into the nature of the quantum phase. 

§ ■ 1 Introduction 

The discoveries of black hole entropy [T] and thermodynamics [2 , and its con- 
nection to quantum theory |3] suggest a deep connection between the funda- 
mental laws of physics and information. The details of the connection are not 
^ I yet known but one possibility is worth exploring: perhaps what we call physics 

is the very useful framework that has been gradually developed to process in- 
formation and make inferences about nature. From this perspective one might 
expect that the actual rules for processing information — probability theory and 
entropic methods — should play central roles in the laws of physics. There is at 
least one example where we know this is true: statistical mechanics and ther- 
modynamics have been derived as an application of the method of maximum 
entropy [1]. 

Our goal is to derive quantum theory as an application of entropic inference. 

One important difference with other approaches that also emphasize notions 
of information (see e.g., [5]-|16j) is the privileged role we assign to position 
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over and above all other observables. Our emphasis on position leads to formal 
similarities with Nelson's stochastic mechanics [T7]-|24). 

Both the entropic dynamics developed here and Nelson's stochastic mechan- 
ics derive quantum theory as a kind of non-dissipative Brownian motion, but 
there are important conceptual differences. First, stochastic mechanics oper- 
ates at the ontological level; its goal is to attain a realistic interpretation of 
quantum theory as arising from a deeper, possibly non-local, but essentially 
classical reality. Entropic dynamics, however, operates almost completely at 
the epistemological level; the emphasis is on making predictions on the basis 
of limited information. A second difference is that stochastic mechanics re- 
quires two assumptions — the existence of a universal Brownian motion and that 
the current velocity is the gradient of some scalar function — which in entropic 
dynamics are derived and not merely postulated. Yet a third difference is that 
stochastic mechanics is somewhat closer in spirit to Smoluchowski's approach to 
the theory of Brownian motion which involves keeping track of the microscopic 
details of molecular collisions through a stochastic Langevin equation and then 
taking suitable averages. Entropic dynamics, on the other hand, is closer to the 
Einstein approach and focuses on those pieces of information that turn out to 
be directly relevant for the prediction of macroscopic effects. The advantage of 
the latter approach is the simplicity that arises from not having to keep track 
of irrelevant details that are eventually washed out in the averages. 

There exist other derivations of the Schrodinger that, like the theory pro- 
posed here, do not appeal to the notion of a "quantum" probability. Some share 
with Nelson's approach the foundation of an underlying classical dynamics with 
an additional stochastic element. The sub-quantum dynamics is variously de- 
scribed by a classical action principle, or a Liouville equation, or Newton's law. 
The additional stochastic element has been introduced in a variety of ways: 
through an extra momentum fluctuation |25j | 26 ) : a hidden non-equilibrium 
thermodynamics [27j : Brownian fluctuations caused by energy exchanges with 
the surrounding vacuum [5S]; coarse graining an underlying dynamics that is 
reparametrization- invariant and ergodic '3IJ' ; tracing out certain inaccessible de- 
grees of freedom [21] ; through explicit dissipation HH] ; and also as the statistical 
mechanics of a particular class of matrix models |35]. In contrast, the entropic 
dynamics proposed here does not assume any underlying dynamics whether clas- 
sical, deterministic, or stochastic. Both quantum theory and its classical limit 
are derived as examples of entropic inference. 

The statistical model is described in section 2. The basic assumption is that 
in addition to the particles of interest there exist some extra variables. In section 
3 we address the basic dynamical question: given that particles move from an 
initial point in configuration space to some other point in its vicinity, where will 
we expect to find them? When approached as an example of inference the prob- 
lem is to select a probability distribution from within a family specified through 
appropriate constraints. The answer is given by the method of maximum en- 
tropy (ME) and the most probable new distribution is that which maximizes an 
entropy (see e.g. |33j). The resulting entropic dynamics is very simple: except 
for fluctuations leading to diffusion the particles tend to drift along the gradi- 



ent of the entropy of the extra variables. (Related work on entropic dynamics 
appeared in [34]-[38].) 

The problem of keeping track of how a succession of small changes builds 
up into a large change requires the introduction of a suitable notion of time. 
The notions of instant, duration, and the directionality of entropic time are 
the subjects of sections 4. Later, in section 8, we discuss the relation between 
entropic time and the presumably more fundamental notion of "physical" time 
and argue that the latter is not needed. Our approach is compatible with the 
theory of dynamical time advocated by J. Barbour in the context of classical 
physics (see e.g. [55]). 

The dynamics obtained in this way is standard diffusion and is described 
by a Fokker-Planck equation (section 5) but quantum mechanics is not just 
diffusion. In addition to a probability distribution p{x, t) from which we can 
generate the magnitude of a wave function we need a second degree of freedom 
that would be associated with its phase. The missing ingredient is supplied in 
section 6. 

Entropic dynamics resembles general relativity in one important respect. 
According to general relativity the geometry of space-time dictates how matter 
must move, and matter reacts back and dictates how the geometry must change 
in response. The lesson of general relativity, if there is one at all, is that there 
is no fixed background: space is a dynamical entity. The situation in entropic 
dynamics is somewhat analogous: The probability distributions for the extra 
variables constitute a statistical manifold, and it is this space that dictates how 
the distribution p{x, t) diffuses. To the extent that the statistical manifold is 
kept frozen one obtains a fairly standard diffusion. Quantum dynamics arises 
when we allow the distribution p{x, t) to react back on the statistical manifold. 
Once the manifold is promoted to a dynamical entity we have a second degree 
of freedom — the entropy of the extra variables — which can be codified into the 
phase of a wave function. The dynamics of the manifold is specified, following 
Nelson [TH] , by requiring that a suitable quantity, which will be called "energy" , 
be conserved and time-reversal invariant (see also J55^ ) . This step completes the 
derivation of the Schrodinger equation. 

The analogy to general relativity is further expanded in section 7 with the 
introduction of a quantum analogue of the gravitational equivalence principle. 
The entropic approach throws new light on old issues such as the linearity 
and unitarity of the Schrodinger equation, the central role played by complex 
numbers, and provides a statistical interpretation not just of the magnitude of 
the wave function but also of its phase. In section 9 the model is extended to 
account for external electromagnetic fields and the corresponding gauge/phase 
transformations which provide further insight into the nature of the quantum 
phase. Final conclusions are collected in section 10. 



2 The statistical model 

Consider particles living in flat three-dimensional space. For a single particle 
the configuration space X is Euclidean with metric 

5ab (^. 

^ab = ^ ■ (1) 

(The reason for the scale factor ct^ will become clear once we generalize to 
N particles.) Our main assumption is that in addition to the particles there 
exist some extra variables that live in a space y and are subject to uncertainty. 
The number and nature of the extra variables y (z y and the origin of their 
uncertainty need not be specified — it is a strength of this formulation that our 
conclusions hold irrespective of any assumptions about the y variables. We only 
need to assume that their uncertainty depends on the location x of the particles 
and this is described by some probability distribution p{y\x). As we shall see it 
is the entropy of the distributions p{y\x) that plays a significant role in defining 
the dynamics of x; the finer details oi p{y\x) turn out to be irrelevant. 

For a single particle the statistical manifold M of distributions p(i/\x) is 
three-dimensional: for each x there is a corresponding p{y\x). Each distribution 
p{y\x) G A4 can be conveniently labeled by its corresponding x so that the 
label X denotes both a regular point in the configuration space X and also its 
corresponding "point" in the statistical manifold A4. For later reference, the 
entropy S{x) of p(y\x) relative to an underlying measure q{y) of the extra- 
variable space y isQ 

S{x) = - jdyp{y\x)\og^-^. (2) 

This entropy S{x) is a natural scalar field on both the configuration space X 
and the statistical manifold M.. 

The peculiar features of quantum mechanics such as non-local correlations 
and entanglement will arise naturally provided the theory for N particles is 
formulated on the 3A^-dimensional configuration space Xm- Accordingly, to 
complete the specification of the model we need to describe X^ and its corre- 
sponding statistical manifold M.n- The generalization is straightforward. For 
A'^ particles the extra- variable distributions are p{y\x) where now the position 
X G Xm is given by x and the index A now takes ZN values. More explicitly 
X = (cc^^a;"^ . . .) where ai — 1,2,3 denotes the first particle, a2 = 4,5,6 de- 
notes the second particle, and so on. The 3A^-dimensional configuration space 
Xm remains flat but it is not, in general, isotropic. For example, for N = 2 
particles the metric, written in block matrix form, is 



lAB 



Saib J <7i 



Sa2b2/'^'^ 



(3) 



^This is a multidimensional integral over all y variables; for simplicity we write dy instead 
of d"y. 



We shall later see that this choice of an anisotropic configuration space leads to 
a theory of particles with different masses. For particles that arc identical the 
appropriate configuration space is isotropic with ai — (J2 ~ ■■■ ^ cr. 

To summarize, the first basic assumption is the existence of some extra 
variables y subject to an x-dependent uncertainty described by some unspecified 
distributions p{y\x). The statistical manifold A4n and the entropy field S{x) 
are convenient inference tools introduced to explore the implications of this 
assumption. 

3 Entropic dynamics 

The second basic assumption is that small changes from one state to another 
are possible and do, in fact, happen. We do not explain why they happen but, 
given the information that changes occur, our problem is to venture a guess 
about what changes to expect. Large changes are assumed to result from the 
accumulation of many small changes. 

Consider a single particle (the generalization to several particles is immedi- 
ate) that moves away from an initial position x to an unknown final position 
x'. All we know about x' is that it is near x. What can we say about a;'? 
Since x and x' represent probability distributions we see that this is precisely 
the kind of problem the method of maximum entropy (ME) has been designed 
to solve, namely, to update from a prior distribution to a posterior distribution 
selected from within a specified seto As in all ME problems success hinges on 
appropriate choices of the entropy, prior distribution, and constraints. 

Since neither the new x' nor the new extra variables y' are known what we 
want is the joint distribution P{x',y'\x) and the relevant space is X xy. To 
find it maximize the appropriate (relative) entropy, 

SlP.Q]^-I^VPi.Wl^no,g^. (4) 

The relevant information is introduced through the prior Q{x\y'\x) and the 
constraints that specify the family of acceptable posteriors P{x', y'\x). 

We select a prior that represents a state of extreme ignorance: the relation 
between x' and y' is not known; knowledge of x' tells us nothing about y' and vice 
versa. Such ignorance is represented by a product, Q{x', y'\x) = Q{x'\x)Q{y'\x). 
Furthermore we take the distributions Q{y'\x)dy' and Q{x'\x)d'^x' to be uniform, 
that is, proportional to the respective volume elements which are respectively 
given by dv^ — "f^^'^d'^x [where 7 — det7„5, see eq.dl])] and by dvy — q{y)dy 
where the measure q{y) need not be specified further. Therefore, since propor- 
tionality constants are not essential here, the joint prior is 

Q{x',y'\x)=j^/\{y). (5) 



^The notion of updating from a prior to a posterior distribution is standard in Bayesian 
inference; it is also appropriate in the context of entropic inference. For a detailed account 
of the use of relative entropy as a tool for updating which includes Bayesian updating as a 
special case see |33| . 



Next we specify the constraints. Write the posterior as 

P{x\y'\x)=P{x'\x)P{y'\x',x) (6) 

and consider the two factors separately. First we require that x' and y' be related 
to each other in a very specific way, namely that P{y'\x' , x) = p{y'\x') G M — the 
uncertainty in y' depends only on x' , and not on previous positions x. Therefore, 
our first constraint is that the joint posterior be of the form 

P{x',y'\x)^P{x'\x)p{y'\x'). (7) 

The second constraint concerns the factor P{x'\x) and represents the fact that 
actual physical changes do not happen discontinuously: we require that x' be 
an infinitesimally short distance away from x. Let x'° — x°- + Ax"". We require 
that the expectation 

(A£2(x', x)) = {j^^Ax'^Ax") = AP (8) 

be some small but for now unspecified numerical value A^^ which could in 
principle depend on x. 

Having specified the prior and the constraints the ME method takes over. 
Substituting the prior ([S|) and the constraint ([7]) into the joint entropy (j3]) gives 



s[p, Q] = - f dx' p{x'\x) log ^^^^jr- + / ^^' ^(^' 



x)S{x') , (9) 



where S(x) is given in eq.Q. Next we vary P(x'\x) to maximize S[P, tt] subject 
to ((HI) and normalization. The result is 

C{x,a) 
where 

C(x, a) = /" dx' e'S(-')-|a(-)A^^(x',x) ^ (^^-^^ 

and the Lagrange multiplier a{x) is determined from the constraint eq.([8]), 

^logC(x,a) = -iAF. (12) 

The distribution (|10p is not merely a local maximum or a stationary point, 
it yields the absolute maximum of the relative entropy S[P,Q] subject to the 
constraints ([7]) and ([5]). The proof (see the appendix) follows the standard 
argument originally due to Gibbs |40| . 

The probability of a step from x to x' , ea. (|10l) . represents a compromise 
between three confiicting tendencies. One, which can be traced to the uniform 
prior Q{x'\x) — 7^/^ and is represented by the first integral in (|9]), is to make 
P{x'\x) spread as uniformly as possible. Another, induced by the second integral 
in ([9]), contributes the entropy term in the exponent of P{x'\x) and favors a 



single giant step to the distribution p{y'\x') that maximizes the entropy S{x'). 
And last, the constraint on l^/S.ti'^) leads to the A£^(a:',a:) term in the exponent 
of P{x'\x) and favors values of x' that are close to x. Large a means short steps. 
The compromise in eq. dTU]) leads to short steps in essentially random directions 
with a small anisotropic bias along the entropy gradient. 

For large a let a;'" — x"" + Aa;°. Expanding the exponent in PH)) about its 
maximum gives 



P{.x'\x) « ——eyip 
Z{x) 



'^Sab (Ax^ - Ar) {Ax' - Ax') 



(13) 



where factors independent of x' have been absorbed into a new normalization 
Z{x). The displacement Ax°- can be expressed as the expected drift plus a 
fluctuation, 

Ax" = Ax" + Aw" , (14) 

where 

2 

{Ax") = Ax" = ^5"'dbS{x) , (15) 

a[x) 

2 

(Aw") = and (Aw" Aw') = -^S"' . (16) 

^ ' a(x) 

The particle tends to move along the entropy gradient. Note that as a — >■ oo the 
steps get correspondingly smaller but the fluctuations become dominant: the 
drift is Ax ~ 0{a^^) while the fluctuations are much larger Aw '-^ 0{a^^^^). 
This implies that as a — >■ oo the trajectory is continuous but not difFcrcntiable — 
just as in Brownian motion. 

We can now return to the unfinished business of choosing A£^ in eq.® 
which is equivalent to choosing the multiplier a{x). We invoke a symmetry 
argument. We just saw that in the limit of infinitesimally short steps the relevant 
dynamics is dominated by the fluctuations Aw. In order that the dynamics 
reflect the translational symmetry of the configuration space X we choose a{x) 
so that the fluctuations (Aw" Aw') in eq. (fT6|) be independent of x. Therefore 
a{x) = constant. 

4 Entropic time 

Our goal is to derive laws of physics as an application of inference methods but 
the latter make no reference to time so additional assumptions are needed. The 
foundation to any notion of time is dynamics. We introduce time as a convenient 
book-keeping device to keep track of the accumulation of small changes. 

In this section we show how a dynamics driven by entropy naturally leads to 
an "entropic" notion of time. Our task here is to develop a model that includes 
(a) something one might identify as an "instant", (b) a sense in which these 
instants can be "ordered" , (c) a convenient concept of "duration" measuring the 
separation between instants. A welcome bonus is that the model incorporates an 



intrinsic directionality — an evolution from past instants towards future instants. 
Thus, an arrow of time does not have to be externally imposed but is generated 
automatically. This set of concepts constitutes what we will call "entropic time" . 
Important questions such as the relation between entropic time, in which 
instants are ordered through the sequence of inference steps, and an externally 
imposed structure of a presumably "physical" time will be discussed later (sec- 
tion 8) after the dynamics has been more fully developed. 

4.1 Time as a sequence of instants 

In entropic dynamics change is given, at least for infinitesimally short steps, by 
the transition probability P{x'\x) in eg. dTSl) . For finite steps the relevant piece 
of information is that large changes occur only as the result of a continuous 
succession of very many small changes. 

Consider the nth step. In general we will be uncertain about both its initial 
and the final positions, x and x' . This means we must deal with the joint 
probability P{x' ,x). Using P{x',x) — P{x'\x)P{x) and integrating over a;, we 
get 

P{x') = Jdx P{x'\x)P{x) . (17) 

It is important to emphasize that this equation is a direct consequence of the 
laws of probability — no assumptions of a physical nature have been made. How- 
ever, if P{x) happens to be the probability of different values of x at a given 
instant of entropic time t, then it is tempting to interpret P{x') as the probabil- 
ity of values of x' at a "later" instant of entropic time t' = t + At. Accordingly, 
we write P{x) — p{x,t) and P{x') = p{x',t') so that 

p{x', t') ^ J dx P{x'\x)p{x, t) (18) 

Nothing in the laws of probability that led to eq. pT)) forces this interpretation 
on us — this is an independent assumption about what constitutes time in our 
model. We use eq. (|18p to define what we mean by an instant: if the distribu- 
tion p{x,t) refers to an "initial" instant, then the distribution p{x' ,t') defines 
what we mean by the "next" instant. Thus, ea.(|18p allows entropic time to be 
constructed, step by step, as a succession of instants. 

4.2 Duration: a convenient time scale 

Having introduced the notion of successive instants we now have to specify the 
interval At between them. Successive instants are connected through the tran- 
sition probability P{x'\x). Specifying the interval of time At between successive 
instants amounts to tuning the steps or, equivalently, the multiplier a(x,t). To 
model a time that, like Newtonian time, flows "equably" everywhere, that is, at 
the same rate at all places and times we define At as being independent of x, 
and such that every At is as long as the previous one. Inspection of the actual 
dynamics as given in eq. (|13m5)) shows that this is achieved if we choose a(x, t) 



so that 

T 

a{x,t) — — — = constant , (19) 

where r is a constant introduced so that At has units of time. As aheady 
anticipated in the previous section, it is the translational symmetry of the con- 
figuration space X expressed as the "equable" flow of time that leads us to 
impose uniformity on the expected step sizes A£ and the corresponding mul- 
tipliers a. This completes the implementation of entropic time. In the end, 
however, the only justification for any definition of duration is that it simpli- 
fies the description of motion, and indeed, the transition probability in ea. (jl3p 
becomes 

which we recognize as a standard Wiener process. A displacement Ax ~ x' — x 
is given by 

Aa;" = 6"(x)Ai + Aw" , (21) 

where the drift velocity h°'{x) and the fluctuation Aw"" are 

{Ax") = b'^At with 6°(x) = — (5"^9hS'(a;) , (22) 

r 

2 

{Aw") = and {Aw" Aw'') = —At 5"'' . (23) 

The constant a^ /2t plays the role of the diffusion constant in Brownian motion. 
The formal similarity to Nelson's stochastic mechanics |17) is evident. An im- 
portant difference concerns the expression of the drift velocity as the gradient 
of a scalar function: unlike stochastic mechanics, here ea.(P^ has been derived 
rather than postulated, and S{x) is not merely an uninterpreted auxiliary scalar 
function — it turns out to be the entropy of the y variables. 

4.3 The directionality of entropic time 



Time constructed according to eq.(|T8|) is remarkable in yet another respect: the 
inference implied by P{x'\x) in ea. (jl3p incorporates an intrinsic directionality 
in entropic time: there is an absolute sense in which p{x, t) is prior and p{x' , t') 
is posterior. 

Suppose we wanted to find a time-reversed evolution. We would write 

p{x, i) = / dx' P{x\x')p{x', t') . (24) 

This is perfectly legitimate but in order to be correct P{x\x') cannot be ob- 
tained from eq. (jl3p by merely exchanging x and x' . According to the rules of 
probability theory P{x\x') is related to eg. dTSl) by Bayes' theorem, 

P{xW) = |^^(-^» ■ (25) 



In other words, one of the two transition probabihties, either P{x\x') or P(x|a;'), 
hut not both, can be given by the maximum entropy distribution eq. (|13p . The 
other is related to it by Bayes' theorem. I hesitate to say that this is what 
breaks the time-reversal symmetry because the symmetry was never there in 
the first place. There is no symmetry between prior and posterior; there is no 
symmetry between the inferential past and the inferential future. 

An interesting consequence of the time asymmetry is that the mean velocities 
towards the future and from the past do not coincide. Let us be more specific. 
Equation (|22p gives the mean velocity to the future or future drift, 

{x<'{t + M)).^.-x<'{t) 
b'^ix) = lim — -P^ — 

lim -— fd2;'P(x'|2:)Ax° (26) 

where x = x{t) , x' — x{t + At), and Ax" = cc'" — x"". Note that the expectation 
in ([^5]) is conditional on the earlier position x = x{t). One can also define a 
mean velocity from the past or past drift, 

x°'{t)-{x''{t-M)).,. 
bt{x)^ lim —- --^ 11^ 27) 

where the expectation is conditional on the later position x = x(t). Shifting the 
time by At, 6" can be equivalently written as 

a;°(t + At)-(a;'^(t))^,J,^J^ 
*^ ' At^o+ At 

lim -^ fdxP(x\x')Ax'' , (28) 

At^o+ At-' V I ; : \ J 



with the same definition of Aa;° as in eq. 

The two mean velocities, to the future 6°, and from the past 6°, do not 
coincide. The connection between them is well known [T7]|19). 



b:{x,t) - 6" (a;) - — a"logp(a;,t) , (29) 

T 

wherqfl d"' = (5° db and p{x,t) — P{x). What might not be widely appreciated 
is that eg. d^ is a straightforward consequence of Bayes' theorem, eq. (^5]) . 
(For a related idea see [41].) To derive ea.p9)) expand P{x') about x in (|25|) to 
get 



P{x\x') 



\^dlogPix,t)^^, 
ox" 



P{x'\x). (30) 



Multiply 6J(a;') in eq. ([28l) by a smooth test function f{x') and integrate, 

/ dx' hl{x')f{x') = l-Jdx'Jdx P{x\x')Ax''fix') . (31) 



^Prom now on indices are raised and lowered with the EucUdean metric <5„ 
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(The limit Af -^ O"*" is understood.) On the right hand side expand f{x') about 
X and use (l30l) . 

^dlog p{x,t) Aa.rdf 



^Jdx'JdxPix'lx) 



Ax''fix)-Ax''Ax'' T; ' ' fix) + Ax'' Ax ^ 

^ ' dx'' ^ ' dx 



(32) 
Next interchange the orders of integration and take Ai — > 0+ using eq. p3| , 



(Ax^Ax") = (Aiu'^Au;'') = — AtiJ"" . (33) 

T 

On integration by parts the third term of ([32]) vanishes and we get 



^dxK{x)f{x)^jdx 



^2 



6-(a;)-_J-''a,logp(x,t) 



T 



f{^) , (34) 



Since f{x) is arbitrary we get (|29|) . 

The puzzle of the arrow of time has a long history (see e.g. [42]). The 
standard question is how can an arrow of time be derived from underlying 
laws of nature that are symmetric? Entropic dynamics offers a new perspective 
because it does not assume any underlying laws of nature - whether they be 
symmetric or not - and its goal is not to explain the asymmetry between past 
and future. The asymmetry is the inevitable consequence of entropic inference. 
From the point of view of entropic dynamics the challenge does not consist 
in explaining the arrow of time, but rather in explaining how it comes about 
that despite the arrow of time some laws of physics turn out to be reversible. 
Indeed, even when the derived laws of physics — in our case, the Schrodinger 
equation — turns out to be fully time-reversible, entropic time itself only flows 
forward. 

5 Accumulating changes: the Fokker-Planck equa- 
tion 

Time has been introduced as a useful device to keep track of the accumulation 
of small changes. The technique to do this is well known from diffusion theory 
|43| . The equation of evolution for the distribution p{x, t), derived from ea. (jl8p 
together with ([1T])-([21]), is a Fokker-Planck equation (FP), 

2 

dtp = ^da [b'^p) + ^VV , (35) 

where da = d/dx°' and V = (5° d^ /dx°'dx^ . The FP equation can be rewritten 
as a continuity equation, 

dtp = -da («V) , (36) 

where the velocity of the probability flow or current velocity is 

v^'^b''- —S''^^ . (37) 

2t p ^ ^ 
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It is convenient to introduce the osmotic velocity 



u^ = -^d^\ogp. (38) 



Its interpretation follows from v°' = fc^+u". The drift 6°, eq.(|22|). represents the 
tendency of the probability p to flow up the entropy gradient while u"^ represents 
the tendency to flow down the density gradient. The situation is analogous to 
Brownian motion where the drift velocity is the response to the gradient of an 
external potential, while m° is a response to the gradient of concentration or 
chemical potential — the so-called osmotic forced The osmotic contribution to 
the probability flow is the actual diffusion current, 

pu- = -^a> , (39) 

Zt 

which can be recognized as Pick's law, with a diffusion coefficient given by 
aV2T. 

Since both the future drift 6° and the osmotic velocity u" are gradients, it 
follows that the current velocity is a gradient too. For later reference, from (|22p 
and dMl), 

2 

w« = — d^cj) , (40) 

T 

where 4>{x,t) — S{x) —\ogp^^'^{x,t). 

With these results entropic dynamics reaches a certain level of completion: 
We figured out what small changes to expect — they are given by P{x'\x) — and 
time was introduced to keep track of how these small changes accumulate; the 
net result is diffusion according to the FP equation (|35|). 

6 Manifold dynamics 

But quantum mechanics is not just diffusion. The description so far has led us 
to the density p{x, t) as the important dynamical object but to construct a wave 
function, ^ = p^/^e'"^, we need a second degree of freedom, the phase 0. The 
problem is that as long as the geometry of the statistical manifold A4 is rigidly 
fixed there is no logical room for additional degrees of freedom. The natural 
solution is to remove this constraint. We allow the manifold Ai to participate 
in the dynamics and the entropy of the y variables becomes a time-dependent 
field, S{x,t). We can take S{x,t) to be the new independent degree of freedom 
but eq.(|40|) suggests that a more convenient and yet equivalent choice is 

<f>{x,t) ^S{x,t)~ log p'/'{x,t). (41) 

Thus the dynamics will consist of the coupled evolution of p{x,t) and (j){x^t). 

*The definition of osmotic velocity adopted by Nelson _i7 and other authors differs from 
ours by a sign. Nelson takes the osmotic velocity to be the velocity imparted by a force applied 
externally in order to balance the osmotic force (due to concentration gradients) and attain 
equilibrium. For us the osmotic velocity is the velocity imparted by the osmotic force itself. 
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6.1 Conservative diffusion 

To specify the dynamics of the manifold M we follow Nelson [TB] and impose 
that the dynamics be "conservative," that is, one requires the conservation of a 
certain functional E[p,S] oi p{x,t) and S{x,t). 

Requiring that some "energy" E[p, S] be conserved may seem natural be- 
cause it clearly represents relevant information but it is an assumption that 
cries out for a deeper justification. Normally energy is whatever happens to 
be conserved as a result of invariance under translations in time. But our dy- 
namics has not been defined yet; what, then, is "energy" and why should it be 
conserved in the first place? This is a question best left for the future. At this 
early stage, for the purpose of deriving a non-relativistic model, we just propose 
an intuitively reasonable conserved energy and proceed. 

The particular form of E[p, S] is chosen to be a local functional that is invari- 
ant under time reversal and we require that the velocities enter in rotationally 
invariant terms |24| . Under time reversal i — ^ — i we have 

b" ^ ~bt , v" ^ -v" , u" -^u", (42) 

In the low velocity limit this means we need only include velocity terms in w^ 
and w^. The proposed energy functional is 

E[p, S]=J d?x p{x, t) [A^^.v-v'^ + Bj^.u-u" + V{x)) , (43) 

where A and B are constants, 7^^, is given by ([I}, and V{x) represents an 
external potential. If E has units of energy then A/a'^ and B/a^ have units of 
mass. Let us define new constants 

2A , 2B 

-^ and p = — 
a'^ a 



m = -^ and p = — ^ , (44) 



which we will call the "current mass" and the "osmotic mass" . Then the energy 
functional is 

E[p,S]^ f d^xp{x,t)(^mv^ + ^pu^ + Vix)j . (45) 

It is further convenient to combine the constant r, which sets the units of time, 
with A into yet a new constant rj, 

77 = — so that — ~ — . (46) 

T T ra 

rj relates the units of mass or energy with those of time. Then the current and 
osmotic velocities, eqs. (l40)) and (l38t are 



mva = rida<p and mua — —rjda log p ' , (47) 

and the energy (j45|) becomes 



E[p,S]^ I dxp(^{daCf>)' + ^{da\0gp)' + V] . (48) 
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When the potential is static, V — 0, energy is conserved, E ~ 0. Otherwise 
we impose that energy increase at the rate 

E= I dxpV . (49) 

Next take the time derivative of (|48l) . After integrating by parts, and using 
eqs.dSe]) and (gO]), 



p = -da (pf") = --d^ {pda(t>) = -- {d'^pda^ + pV^ 4>) , (50) 

m m 



we get 

E-Jd.pV^Jd.p\,c^+l-ida^)^ + V-^^ 
The left hand side vanishes for arbitrary choices of p provided 



(51) 



„^ + |_,a„„» + V-i^^^O, (52) 



Equations (l50|) and ([52|) are the coupled dynamical equations we seek. They 
describe entropic diffusion and energy conservation. The evolution of p{x,t), 
eq. ([5n|) . is guided by 4'{x,t); the evolution of 4'{x,t), ea. ((5^ . is determined by 
p(x, t). The evolving geometry of the manifold Ai enters through 4>{x, t). 

6.2 Classical limits 

Before proceeding further we note that writing Srj — 'n4> in equations (I47p and 
(|52p and taking the limit 77 — >■ with Shj, Tn, and p, fixed leads to 



mva = da Shj and u^ = , (53) 

and to the Hamilton-Jacobi equation 

SHj + :^{daSHjf + V = 0. (54) 

2m 

This suggests that the constant m be interpreted as the inertial mass. Further- 
more, eq. (f22l) tells us that the particle is expected to move along the entropy 
gradient, while eq. ([23l) . 

(Aw") = and (Aw'' Aw'') = ^AtS"'' -^ , (55) 

says that the fluctuations about the expected trajectory vanish. We conclude 
that in the limit 77 — > entropic dynamics reproduces classical mechanics with 
classical trajectories following the entropy gradient. A similar classical limit can 
also be attained for fixed rj provided the mass m is sufficiently large. 
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The limit // -t- for fixed 77, Shj, and m is also interesting. This situation 
is also ruled by the classical Hamilton- Jacobi equation ((54|) . but the osmotic 
velocity does not vanish, 

mva= daSnj and mua = rjda^ogp^^^ . (56) 

The expected trajectory also lies along a classical path but now, however, it 
does not coincide with the entropy gradient. More important perhaps is the 
fact that the fluctuations Aw" about the classical trajectory do not vanish. 
The limit /j, ^' is a different "classical" limit; whether it corresponds to an 
actual physical situation remains to be seen. 

6.3 The Schrodinger equation 

Next we show that, with one very interesting twist, the dynamical equations 
([50]) and ((5^ turn out to be equivalent to the Schrodinger equation. We can 
always combine the functions p and (/> into a complex function 

* = pi/2 exp(i(/)) . (57) 

Computing its time derivative, 

*-(t-+«<^)*' (58) 



V^f^**)!/^ 



.2p 
and using eqs. ([50|) and (f52|) leads to 






2m 2m V m/ (^**)i/2 ^ 

This reproduces the Schrodinger equation. 



ihr^ = -^—\/H + V^, (60) 



provided the current and osmotic masses are equal, m = p,, and ry is identified 
with Planck's constant, r/ = h. 

But why should the osmotic mass be precisely equal to the inertial mass? 
Why can't we say that entropic dynamics predicts a non-linear generalization of 
quantum theory? This question is so central to quantum theory that we devote 
the next section to it. But before that we note that the non-linearity is un- 
desirable both for experimental and theoretical reasons. On one hand, various 
types of non-linearities have been ruled out experimentally to an extreme degree 
through precision experiments on the Lamb shift 22 and even more so in hyper- 
fine transitions [4T. On the other hand, from the theory side it is the fact that 
time evolution preserves linear superpositions that leads to the superposition 
principle and makes Hilbert spaces useful. In addition, there is a consistency 
argument that links the linearity of the Hilbert space and the linearity of time 
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evolution [TU] . Retaining one and not the other leads to inconsistently assigned 
amplitudes showing that the very concept of quantum amplitudes is a reflection 
of linearity. And, as if that were not enough, it has also been shown that in 
the presence of non-linear terms entangled particles could be used to achieve 
super luminal communication |45| . Therefore it is extremely probable that the 
identity of inertial and osmotic mass is exact. 

There is another mystery in quantum theory — the central role played by 
complex numbers — which turns out to be related to these issues. The dynamical 
equations ([50]) and ([5^ contain no complex numbers. It is true that they contain 
two degrees of freedom p and 4> and that these two can be combined into a single 
complex number VP = p^ "e"''. This is a triviality, not a mystery: the dynamical 
equations can always be reformulated into an equation for ^ and its conjugate 
\['*. The statement that complex numbers play a fundamental role in quantum 
theory is the non-trivial assertion that the equation of evolution contains only 
^ and not \1/ and also its conjugate \1/*. In the entropic approach both the linear 
time evolution and the special role of complex numbers are linked through the 
equality m = ^l. 

7 A quantum equivalence principle 

The generalization to N particles is straightforward. As indicated at the end 
of section 2, the configuration space has 3A^ dimensions and the system is rep- 
resented by a point x ^ x^ ^ {x°-^ ,x°''^ . . .). The corresponding Fokker-Planck 
equation [see eqs.©, (155)) and PO)) ] is 



^ ..AB' 



N 
dtp = --^"""dAipdB^) = -Y.da^ (P<") (61) 



where 4){x,t) is given by ea. (l4ip . The current and osmotic velocities are 

^2 „2 



Vn=- 



d^-cj) and <" = -^9°"logpi/2 ^ (62) 

T 

and the conserved energy is 

E= f d^'^x p{x, t) {Aj^bV^v^ + B-fj^gU^u^ + V{x)) . (63) 



Introducing the inertial (or current) and osmotic masses. 



2^ , 2S 

-^ and /!„ = -^ , (64) 



and the constant rj — 2A/t, eqs. (|6T|) and (|63| become 



5tP=-E— 5a„(p9""0) , (65) 
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E[p,S]=Jd'''xp(E[^{da,Af + ^{daJogpf]+V{x)^ . (66) 

Imposing, as before, that E — J pV = for arbitrary choices of p leads to the 
modified Hamilton- Jacobi equation. 

Finally, the two eqs. (j65p and (|67p can be combined into a single equation for 
the complex wave function, ^I* = p^/^e*"^, 

Eq. (j64p shows that the ratio of osmotic to inertial mass turns out to be a 
universal constant, the same for all particles: /i„/TO„ — B/A. This can be traced 
to a choice of energy that reflects the translational and rotational symmetries 
of the configuration space. But why should /i„ = rrin exactly! To see this we 
go back to eq. (p5)) . We can always change units and rescale rj and r by some 
constant n into rj ~ kt]' , r — t' /k. If we also rescale (p into (p — ip /k, then 
eqs. (|65l) and (|66l) become 

dtP^-E—daApd''''^') , (69) 



E[p,S] ^ jd^^xp {^[^^{d^Jf + ^^|5^(^- l°g^)'] + ^ 



(70) 



Following the same steps that led to ea. (j87p . we can introduce a different wave 
function ^' = p^/"^ exp{i(j)') which satisfies 

Since the mere rescaling by k can have no physical implications the different 
"regraduated" theories are all equivalent and it is only natural to use the sim- 
plest one: we choose k = {A/By^^ so that p.„/v^ = to„ and we can rescale the 
old /x„ to a new osmotic mass p'^ — p^n^ = rUn- 

The net result is that the non- linear terms drop out. Dropping the prime on 
\1/' and identifying the rescaled value ij' with Planck's constant h, leads to the 
linear Schrodinger equation, 

ih^ = E 7^ Vf,* + y* . (72) 

We conclude that for any positive value of the original coefficients /i„ it is 
always possible to regraduate rj, <j) and /i„ to a physically equivalent but more 
convenient description where the Schrodinger equation is linear and complex 
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numbers attain a special significance. From this entropic perspective the hnear 
superposition principle and the complex Hilbert spaces are important because 
they are convenient, but not because they are fundamental — a theme that was 
also explored in [TU] . 

These considerations remind us of Einstein's original argument for the equiv- 
alence principle: We accept the complete physical equivalence of a gravitational 
field with the corresponding acceleration of the reference frame because this 
offers a natural explanation of the equality of inertial and gravitational masses 
and opens the door to an explanation of gravity in purely geometrical terms. 

Similarly, in the quantum case we accept the complete equivalence of quan- 
tum and statistical fluctuations because this offers a natural explanation of the 
Schrodinger equation — its linearity, its unitarity, the role of complex numbers, 
the equality of inertial and osmotic masses. Furthermore, it opens the door 
to explaining quantum theory as an example of statistical inference — entropic 
dynamics on a suitably non-trivial evolving manifold. 

8 Entropic time vs. "physical" time 

Now that the dynamics has been more fully developed we should revisit the 
question of time. Entropic time has turned out to be useful in ordering the 
inferential sequence of small changes but it is not at all clear that this order 
has anything to do with the order relative to a presumably more fundamental 
"physical" time. If so, why does 'entropic time' deserve to be called 'time' at 
all? 

The answer is that the systems we are typically concerned with include, in 
addition to the particles of interest, also another system that one might call the 
"clock" . The goal is to make inferences about correlations among the particles 
themselves and with the various states of the clock. Whether the inferred se- 
quence of states of the particle-clock composite agrees with the order in "phys- 
ical" time or not turns out to be quite irrelevant. It is only the correlations 
among the particles and the clock that are observable and not their "absolute" 
order. 

This is an idea that demands a more explicit discussion. Here we show how 
it gives rise to the notion of simultaneity that turned out to be central to our 
definition of an instant in section 4.1. 

Consider a single particle. From the probability of a single step, eg. (ITU)) or 
p^ . we can calculate the probability of any given sequence of (short) steps 
{x,xi, . . . ,Xn,. . ■}■ Since the path is an ordered sequence of events when two 
events lie on the same path it is meaningful to assert that one is earlier (in the 
entropic time sense) than the other: Xn is earlier than Xn+i- The actual path, 
however, is uncertain: how do we compare possible events along different paths? 
We need a criterion that will allow us to decide whether an event x' reached 
along one path is earlier or later than another event x" reached along a different 
path. This is where the clock comes in. The role of the clock can be played, 
for example, by a sufficiently massive particle. This guarantees that the clock 
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follows a deterministic classical trajectory xc = xcit) given by eas-lfM]) and 
([55]) and that it remains largely unaffected by the motion of the particle. 

The idea is that when we compute the probability that, say, after n steps the 
particle is found at the point Xn we implicitly assume that its three coordinates 
a:^, and x^ are attained simultaneously. This is part of our definition of 



X. 



1 ^2 



n; 



an instant. We make the same definition for composite systems. In particular, 
for the particle-clock system, x^ = {x^, a;g„), the coordinates of the particle x'^ 
{a = 1,2,3) are taken to be simultaneous with the remaining coordinates that 
describe the clock a;g.„ (a = 4, 5, . . .). Thus, when we say that at the nth step 
the particle is at x'}^ while the clock is at a;g„ it is implicit that these positions 
are attained at the same time. 

By "the time is i" we will just mean that "the clock is in its state xc = 
xc{t)." We say that the possible event that the particle reached x' along one 
path is simultaneous with another possible event x" reached along a different 
path when both are simultaneous with the same state xc {t) of the clock: then 
we say that x' and x" happen "at the same time t." This justifies using the 
distribution p{x, t) as the definition of an instant of time. 

In the end the justification for the assumptions underlying entropic dynamics 
lies in experiment. The ordering scheme provided by entropic time allows one to 
predict correlations. Since these predictions, which are given by the Schrodinger 
equation, turn out to be empirically successful one concludes that nothing deeper 
or more "physical" than entropic time is needed. A similar claim has been 
made by J. Barbour in his relational approach to time in the context of classical 
dynamics [39j . 

9 Dynamics in an external electromagnetic field 

Entropic dynamics is derived from the minimal assumptions that the extra 
variables y are intrinsically uncertain and that motion consists of a succession 
of short steps. These two pieces of information are taken into account through 
the two constraints ([7]) and ^. Special circumstances may however require 
additional constraints. 

9.1 An additional constraint 

Consider a single particle placed in an external field the action of which is to 
constrain the expected component of displacements along a certain direction 
represented by the unit covector na{x). This effect is represented by the con- 
straint 

{^x'^uaix)) = C{x) , (73) 

where the spatial dependence of C{x) reflects the non-uniform intensity of the 
external field. It is convenient to define the magnitude of the external field in 
terms of the effect it induces. Thus we introduce the external field 

^„W„^ (74) 
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and the constraint is 

(Ax'^Aaix)) = C , (75) 

where C is some constant that reflects the strength of the couphng to Aa ■ 

9.2 Entropic dynamics 

The transition probabihty P{x'\x) is that which maximizes the entropy S[P, Q] 
in ^ subject to the old constraints plus the new constraint ([75)) . The result is 

p(x'\x) = ^^ 5(x')-laA£^(x\x)-ffAx°A„(x) ^ (^g) 

where 



and the Lagrange multipher /3 is determined from the constraint eq. (j75|) . 

^logC(x,a,/3) = -C. (78) 

From here on the argument follows closely the previous sections. For large 
a the transition probability (|76p can be written as 

P{x'\x)ocexp[^^^Sab{Ax'^-Ax-){Ax'~Ax')] , (79) 

where we used ([19]), (|46|) . and units have been regraduated to set ?/ = H. There- 
fore, the displacement Ax" can be expressed in terms of a expected drift plus a 
fluctuation, Ax" = AS" + Aw" , where 

{Ax'') = Ax''=b''At where b" = —S''''[dbS ~ l3Ab] , (80) 

(Aw'')=0 and (Aw^Au;'') := — At<5'''' . (81) 

^ ' TO 

Once again, for short steps the dynamics is dominated by the fluctuations. The 
only difference is the replacement of dS by the gauge invariant combination 
dS — (3 A. Small changes accumulate according to the FP equation (p6|) but now 
the current velocity is no longer given by eq.(|40p but rather by 

w" = A(a'»0_/3A°) , (82) 

TO 

and the FP equation is 

p = -da {pv^) - -^d^ipidaCJ) - pAa)] , (83) 

(j) is still given by (|41l) and the osmotic velocity ((38)) remains unchanged. 
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The energy functional is the same as (|45p. but now v is given by eq. 

E= fdxp(^ida(f>~/3Aaf + ^{da\0gpf+v) , (84) 

where we set /i = m and rj = h. 

It is simplest to start with static external potentials, V = and ^ = 0, 
so that the energy is conserved, E ~ 0. Just as before after taking the time 
derivative, integrating by parts, and imposing that E = for arbitrary choices 
of p, we get 

^0+2;^(a.^-/3^.f + ^-^^^O. (85) 

Equations (|83l) and (f85|) are the coupled equations for p and </> that describe 
entropic dynamics in the external potential Aa- 

Setting Shj — il'P ^^id taking the classical limit /i — >■ leads to the classical 
Hamilton- Jacobi equation in an external electromagnetic field showing that the 
Lagrange multiplier /3 plays the role of electric charge. More precisely, 

where e is the electric charge and c is the speed of light. Thus, in entropic 
dynamics electric charge is a Lagrange multiplier that regulates the response to 
the external electromagnetic potential Aa ■ (If desired we can further separate V 
into electric and non-electric components, V = cAq + V', but this is not needed 
for our present purposes.) 

As before, the Schrodinger equation results from combining the functions p 
and (j) into the wave function, 'I' — p^''^ exp{i(j)). Computing the time derivative 
\1/ using eqs.([83|) and ([85|) leads to the Schrodinger equation. 



ih^ = ^i^da - ^Aaf^ + V^ , (87) 

ot 2m he 

The derivation above assumed that energy is conserved, -B = 0, which is true 
when the external potentials are static, ^ = and A = 0, but this limitation is 
easily lifted. For time-dependent potentials the relevant energy condition must 
take into account the work done by external sources: we require that the energy 
increase at the rate 

E= I dxp{V + -pv°-Aa) . (88) 

The net result is that equations (|85p and ([57]) remain valid for time-dependent 
external potentials. 

9.3 Gauge invariance 

We have seen that in entropic dynamics the phase of the wave function receives 
a statistical interpretation, (p = S — logp^''^. On the other hand, without any 
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physical consequences, the phase can be shifted by an arbitrary amount, 

(f>ix, t) -> (i)'(x, t) = (j)[x, t) + I3x{x, t) , (89) 

provided the potential is transformed appropriately, Aa — ?> A'^ — Aa + daX- This 
raises several questions. 

First, how is the statistical interpretation of (j) affected by the possibility of 
gauge transformations? The straighforward answer is that </> reflects a combina- 
tion of several effects — the extra variables (through their entropy S) , the osmotic 
effect of diffusion (through the density p) , and the choice of potential (through 
the function x) — but these separate contributions are not necessarily easy to 
disentangle. Indeed, ea. ([5(I| for the drift velocity shows that the dynamics de- 
pends on S and on A only through the combination dS — 13 A. Therefore we can 
envision two situations that are informationally inequivalent: one is character- 
ized by entropy 5* and constraint {Ax"'Aa) — C, the other by a different entropy 
S' and also by a different constraint {Ax'^A'^) = C . Remarkably they lead to 
exactly the same physical predictions provided the entropies and potentials are 
related by S' — S+Px and A' = A+dx where xi^} t) is some arbitrary function. 
Thus local phase invariance can be interpreted as local entropy invariance. 

A second question was first raised in the context of stochastic mechanics and 
concerns the single- or multi-valuedness of phases and wave functions. Wall- 
strom [33] noted that when stochastic mechanics is formulated d la Nelson [T7] 
the current velocity v is postulated to be the gradient of some locally defined 
function </>. Now, being a local gradient does not imply that v will also be a 
global gradient and therefore both the phases (j) and their corresponding wave 
functions Vf will, in general, be multi- valued — which is unsatisfactory. A possi- 
ble way out is to formulate stochastic mechanics in terms of an action principle 
[20] . Then the current velocity is indeed a global gradient and both phases and 
wave functions are single- valued. But this is a problem too: single- valued phases 
can be too restrictive and exclude physically relevant states. For example, the 
usual way to describe states with non-zero angular momentum is to use multi- 
valued phases (the azimuthal angle) while requiring that the corresponding wave 
functions remain single- valued. 

The same questions can be raised in entropic dynamics and also within the 
standard quantum framework. Why should wave functions be single-valued? 
The answer we favor is essentially the same offered by Pauli in the context of 
standard quantum mechanics [47]. He suggested that the criterion for admissi- 
bility for wave functions is that they must form a basis for a representation of 
the transformation group {e.g., the rotation group) that happens to be pertinent 
to the problem at hand. Pauli's criterion is extremely natural from the perspec- 
tive of a theory of inference: in any physical situation symmetries constitute the 
most common and most obviously relevant pieces of information. 

In entropic dynamics the entropy S{x, t) and the probability density p{x, t) 
are single-valued functions. Therefore, a natural choice is that the phase, 
(j) — S ~ logp^/^, be single- valued too. A situation with non- vanishing angular 
momentum can be handled through a constraint. For example, one can use a 
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single- valued phase and an appropriately chosen vector potential — which might 
perhaps be a pure gauge, Aa = —daX- Alternatively, we can gauge the potential 
away io A'^ ~ Q and use a multi- valued phase, <j}' ~ S — logp^/"^ + j3x- Which 
of these two options is to be preferred depends on whether the goal is clarity of 
interpretation or simpler mathematics. As for the appropriate choice of poten- 
tial, Aa = 9aX: ^^ adopt Pauli's criterion: the admissible wavefunctions — that 
is, the various functions (p, S, x) that appear in the formalism — must form a 
basis for a representation of the pertinent transformation group. 

10 Summary and Conclusions 

Our goal has been to derive quantum theory as an example of entropic inference. 
The challenge is to develop a framework that clarifies the conceptual difficulties 
that have plagued quantum theory since its inception while still reproducing its 
undeniable experimental successes. This means that to the extent that what has 
been derived is quantum mechanics and not some other theory we should not 
expect predictions that deviate from those of the standard quantum theory — at 
least not in the non-relativistic regime discussed in this paper. On the other 
hand, the motivation behind this whole program lies in the conviction that it is 
the clarification and removal of conceptual difficulties that will eventually allow 
us to extend physics to other realms — gravity, cosmology — where the status of 
quantum theory is more questionable. 

The framework of entropic inference is of general applicability. Its applica- 
tion to any particular problem requires assumptions that specify the intended 
subject matter and those pieces of information are considered relevant. The 
main assumptions can be summarized as follows: 

(a) The goal is to predict the positions x of some point particles. Since the 
information available is limited we can at best obtain a probability distribution 
p{x) in the configuration space X. We assume that X is flat, and that it is 
isotropic or anisotropic depending on whether the particles are identical or not. 

(b) We assume that the world includes other things in addition to the particles: 
these extra things are described by variables y that can influence and in turn can 
be influenced by the particles. The uncertainty in the values of y is described 
by distributions p{y\x) in a statistical manifold Ad. The theory is robust in 
the sense that its predictions are insensitive to most details about the extra 
variables. 

(c) We assume that large changes result from the accumulation of many suc- 
cessive short steps. The transition probability for a short step P{x'\x) is found 
using the method of maximum entropy. This requires assumptions about the 
prior (which we take to be uniform) and constraints (that changes happen con- 
tinuously and that after each short step the new p{y'\x') remains within the 
same statistical manifold M). The result is that the dynamics of the particles 
is driven by the entropy S{x) of the extra variables. 

(d) A notion of time is introduced in order to keep track of the accumulation of 
small changes. This requires assumptions about what constitutes an 'instant' 
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and about how time is 'constructed' as a succession of such instants. The 
choice of interval between instants is a matter of convenience — we choose a 
notion of duration that reflects the translational symmetry of the configuration 
space. The result is that the distribution p evolves according to a Fokker-Planck 
equation. 

(e) We assume that the particles react back and affect the entropy S{x) of the 
extra variables in such a way that there is a conserved 'energy' E[p, S] = const. 
The specifics of this interaction are described through the functional form of 
E[p,S]. 

(f ) Electromagnetic interactions are described by including an additional con- 
straint on the expected displacement along a certain field Aa{x). 

No further assumptions are made. 

The statistical model is specified by several parameters, {(t'^^, t, A, B, /3}. The 
anisotropy of configuration space for non-identical particles is parametrized by 
fT^ with n = 1 . . . A^; r defines units of time; A and B parametrize the relative 
strengths of the current and osmotic terms in the energy functional; and, finally, 
[3 is the Lagrange multiplier associated to the electromagnetic constraint. These 
parameters can be suitably regraduated and combined with each other into the 
familiar set which includes the masses and charges of the particles and Planck's 
constant. 

We conclude with a summary of our conclusions. 
On epistemology vs. ontology: Quantum theory has been derived as an ex- 
ample of entropic dynamics. In this model "reality" is reflected in the positions 
of the particles and the values of the extra variables, and our "limited infor- 
mation about reality" is represented in the probabilities as they are updated to 
reflect the physically relevant constraints. 

Quantum non-locality: Entropic dynamics may appear classical because no 
"quantum" probabilities were introduced. But this is deceptive. Probabilities, 
in this approach, are neither classical nor quantum; they are merely tools for 
inference. Phenomena that would normally be considered non-classical, such as 
non-local correlations, emerge naturally from constraints in configuration space 
which include the osmotic or, equivalently, the quantum potential terms in the 
energy functional. 

The presence of a quantum potential may suggest a connection between our 
(epistemological) entropic dynamics and Bohm's ontological interpretation j48j . 
There is none. It is true that both theories agree on the same Schrodinger 
equation, and therefore on the same modified version of the Hamilton-Jacobi 
equation. But Bohmian mechanics is meant to reflect "reality" ; particles are 
supposed to follow smooth causal trajectories along the gradient of the phase, 
V0. In contrast, entropic dynamics reflects information; particles follow non- 
differentiable trajectories and it is the probability distribution p, not the parti- 
cles, that evolves along W(f>. 

On interpretation: Ever since Born the magnitude of the wave function 
|\['P — p has received a statistical interpretation. Within the entropic dynamics 
approach the phase of the wave function is also recognized as a feature of purely 
statistical origin. When electromagnetic interactions are introduced the gauge 
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invariance is interpreted as an invariance under local entropy transformations. 
On dynamical laws: The principles of entropic inference form the backbone of 
this approach to dynamics. The requirement that an energy be conserved is an 
important piece of information (i.e., a constraint) which will probably receive 
its full justification once a completely relativistic version of entropic dynamics 
is developed. 

On time: The derivation of laws of physics as examples of inference requires 
an account of the concept of time. Entropic time is modelled as an ordered 
sequence of instants with the natural measure of duration chosen to simplify the 
description of motion. We argued that whether the entropic order agrees with 
an objective order in an external "physical" time turns out to be an empirically 
inaccessible question, and in this sense, the notion of a "physical" time is not 
needed. Most interestingly, the entropic model of time explains the arrow of 
time. 

Equivalence principle: The derivation of the Schrodinger equation from en- 
tropic inference led to an interesting analogy with general relativity. The sta- 
tistical manifold M is not a fixed background but actively participates in the 
dynamics. 
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Appendix 

We claim that the distribution P{x' ,y'\x) given by (fTO| and ^ yields the ab- 
solute global maximum of the relative entropy 5[P, Q] subject to the constraint 
([5]). Substituting ([TU)) into © the conjectured maximum value is 

S[P, Q] = \a{M^) - log(C7'/') = 5„,ax . (90) 

On the other hand, the entropy /C[P, P] of any distribution P{x' ^y'\x) relative 
to P{x' ,y'\x) satisfies the Gibbs inequality pp], 

/C[P, P] = - I dx'dy' P{x', y'\x) log ^^J^J^TT ^ ^ ' (^1) 

J r[X , y \X) 

with equality if and only if P = P. Introducing the prior Q and rearranging we 
get 

S[P,Q] < - J dx'dy' Pix',y'\x)log^^;^^ . (92) 
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We require that P satisfy the same constraints (O and (|H]) and use PH)) to get 

Pix'\x)p{y'\x') 



S[P,Q] < ~ f dx'dy' P{x'\x)p{y'\x') log 



7l/2q(y) 



dx'P{x'\x)S{x')- I dx' P{x'\x) log ^^""Jf ^ 5n,ax -(93) 

Therefore any P{x',y'\x) satisfying the same constraints as P{x',y'\x) lowers 
the relative entropy. 
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